Peirce ’ s i and Cohen ’ s κ for 2 × 2 Measures of Rater Reliability
نویسندگان
چکیده
This study examined a historical mixture model approach to the evaluation of ratings made in “gold standard” and two-rater 2 × 2 contingency tables. Peirce’s i and the derived i average were discussed in relation to a widely used index of reliability in the behavioral sciences, Cohen’s κ. Sample size, population base rate of occurrence, the true “science of the method”, and guessing rates were manipulated across simulations. In “gold standard” situations, Peirce’s i tended to recover the true reliability of ratings as well as better than κ. In two-rater situations, iave tended to recover the true reliability as well as better than κ in most situations. The empirical utility and potential theoretical benefits of mixture model methods in estimating reliability are discussed, as are the associations between the i statistics and other modern mixture model approaches.
منابع مشابه
Assessing reperfusion with whole-brain arterial spin labeling: a noninvasive alternative to gadolinium.
BACKGROUND AND PURPOSE Arterial spin labeling (ASL) is a perfusion imaging technique that does not require gadolinium. The study aimed to assess the reliability of ASL for evaluating reperfusion in acute ischemic stroke in comparison with dynamic susceptibility contrast (DSC) imaging. METHODS The study included 24 patients with acute ischemic stroke on admission and 24-hour follow-up ASL and ...
متن کاملUse of hand-held Doppler ultrasound examination by podiatrists: a reliability study
BACKGROUND Hand held Doppler examination is a frequently used non-invasive vascular assessment utilised by podiatrists. Despite this, the reliability of hand-held Doppler has not been thoroughly investigated. Given the importance of Doppler in completing a vascular assessment of the lower limb, it is essential to determine the reliability of the interpretation of this testing method in practici...
متن کاملInter-rater reliability of AMSTAR is dependent on the pair of reviewers
BACKGROUND Inter-rater reliability (IRR) is mainly assessed based on only two reviewers of unknown expertise. The aim of this paper is to examine differences in the IRR of the Assessment of Multiple Systematic Reviews (AMSTAR) and R(evised)-AMSTAR depending on the pair of reviewers. METHODS Five reviewers independently applied AMSTAR and R-AMSTAR to 16 systematic reviews (eight Cochrane revie...
متن کاملOne-two-triage: validation and reliability of a novel triage system for low-resource settings
OBJECTIVES To validate and assess reliability of a novel triage system, one-two-triage (OTT), that can be applied by inexperienced providers in low-resource settings. METHODS This study was a two-phase prospective, comparative study conducted at three hospitals. Phase I assessed criterion validity of OTT on all patients arriving at an American university hospital by comparing agreement among ...
متن کاملBladder Prolapse Configuration on Baseline Standing Cystogram Can Predict Anterior Vaginal Wall Suspension Procedure Outcomes.
OBJECTIVE To evaluate whether bladder prolapse shape on lateral voiding cystourethrogram (VCUG) is an accurate predictor of anterior vaginal wall suspension (AVWS) procedure outcomes. METHODS Following an institutional review board approval, preoperative lateral standing VCUG views from a prospectively maintained database of women who underwent AVWS for stage ≥2 cystocele were reviewed retros...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010